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ORIENTATION 
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Examwoor: <3han 
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April 20, 2004 

750 B STREET, Sui te 3 1 20 

San Die.?o, CA 921C 1 



OFFICIAL 



DECLARATION UNDER RULE 131 

Commissioner of Patents and Trademarks 
Washington, DC 20231 

Dear Sin 

I* Amon Amir, declare as follows: 

I am one of the original inventors of the invention claimed in the above-captioned applicalkttu 
As evidenced by the enclosed IBM Invention Disclosure form dated as being created or Mar*: h 28, 2000 
and modified on April S f 2000, 1 and my co-inventor conceived of the present invendor, at least { rior to Decembe r 
'k 2000. Specifically, using the limitations of Claims 1 and 6 as an example ;md referring to die enclosed 
<locument, we conceived of a a amsputer^irnplemented method for generating a i rain a Jjust sigr at to i^tabl ish an 
midio output level that includes receiving a petson-microphooe position signal wWch represents a position of ?. 
]?ersoo relative to a microphone (page 2, paragragp number 2, last sentence). The method hc cocceived also 
Includes desannining a gain adjust signal based at least in part on the person-microphoiicpo^tio 1 3igu?l fid.) , and 
rising the gain adjust signal to establish the audio output level (page 3, Rgnre 1>, In cur conce; Jtion, the pexsor. • 
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microphone position signal can be derived from a motion sensing system or a portion sensta g system or an 
orientation sensing system or a distance sensing system (third page of enclosed document, latoltetf 'pag# 4", 
bottom of page). The disclosure also teaches several additional features of one or more depends it claims, 

1 declare that we were diligent in reducing tbe invention to ptacdec at Jeast :frt>m itirafc pri or to December 
4, 2000. Specifically, I declare that we submitted the enclosed invention disclosure prior to Decft nber H, 2GQ0 m 
JE*M Intellectual Pro p erty Department, which then diligently processed the application for disci* >sirrr to outsids 
counsel and subsequent filing within the usual course of IBM business in filing patent applicaticns. 

I hereby declare that all statements made herein of my own knowledge are oik and that all itateii tints mads 
on information and belief are believed to be true; and farther that these st a teme n ts were made wit! the knowledge 
that willful false statements and the like so made are punishable by frne or imprisonment, or botl ^ under Section 
1001 of Title 1ft of the United State Code and that suxtowlflful, false statements m 
a|>plication or any patent issued thereon. 

BY: ARNON AMIR 

data: , ) I 

tfjzQjltotf 
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[Response Due to IP&L ; 0S/O5/2Q00 ' 



Mam Idea 



Automatic Gain Control of Audio Level by Speaker Motion Analysis 
*idea of disclosure 

sxss^ the prob,em so,ved ,if approp ^' an * - 

The present innovation is in the field of video and audio processing 

I C e °r n n C t mS r h UndeSired Variat,0n of reCorded au *° 'evel of speech 
due to speaker mot.on and speaker head rotation. It presents a method 
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Automatic Gain Control of Audio Level by Speaker Motion Analysis - continued ^ *' 

to automatically compensate for these changes in the recorder audio 

level by examining the speaker movement in the video, detecting his relative 

position and head rotation, and compensating for the audio level accordingly. 

^rA n l°r a ? P " eS -° ™Y fie ' dS Where audio is bein 9 t^^rred by microphones, 
remott * , ? nferer \ c,n 9 and vide0 conference, lecture videotaping, distance learning 
remote educate end traming, to cellular and other mobile phones and communication 
™,k ■ mU f *"? ■? ° Und recordin 9' radio ' television and other media. In all these fields 

f£ft 2 'AUK U .« , V J ™ crophones and 80 the V COu,d l^e«ge from a "smart" microphone 
that >ts sensitivity is adaptive to the speaker motion. ' 

LilntTJ— t t ie J nve ; tion s ? ,v . e tne P"*'*™ or achieve an advantage^ description of "the J5J 
invention , including figures inline as appropriate)? Xj 
When a speaker or lecturer is recorded, a microphone is used to receive 

the voice audio signal into a recording device. The microphone is typically ^ 
fixed on a stage, wired to the speaker clothes, or held by hand. When the ^ 
speaker is moving, walking, leaning or rotating his head, the distance 
and direction to the microphone changes, and therefor the received audio 
level varies. 

The recorded audio level gets higher as the microphone gets closer to the 
speaker mouth. Similarly, when the head is directed away from the microphone, 
the audio level decreases. These changes result in quality degradation 
of recorded aud.o, and have direct impact on the quality of audio end video 
playback on audio analysis (e.g., speech recognition) etc. The process 
aesenbed hereon measures the speaker physical position and uses it to 
compensate for the undesired changes in audio level by chanainq the 
audio gain accordingly. 

It is important to note that changes in audio level happened not only 
due to changes in the speaker physical position but also due to intentional 
and unintentional speech changes by the speaker (e.g.. to emphasis a 
certain sentence), due to language switching, different speakers etc. 
Therefor, changing the audio gain to make the audio level constant will 
distort the original, will cause the loss of information embedded 
m the .ntended speech level change, and hence is undesired. The purpose 
of the present innovation is to isolate the amount of audio level 

S 0 7e^inat r e e ft POnd$ " ^ °' *' miCr ° phone •"«*•'"«« and 

hv ! tr T 'V n ? aCh '* ne read8ble audi0 s} 9 nal which cor **™ speech 
by a speaker. A video input comprise of a machine readable moving 
pictures signal in which the speaker is shown, in part or in whole 

thT-u^r ? 8nd 9 Vid8 ° Stream ' the P r °P° sed Process changes 

the aud,o level using a process comprise of the following parts: 

1 . means of audio and video input 

2. means of audio output 

3. means of detecting the speaker physical position in the moving images 

4. means of determining the audio gain according to the detected motion 

5. means of changing the audio gain between audio input and output 
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Automatic Oai* Control of a* Uve, * Speakar MotJon . C 

Clearly, other mapping functions can be used. 

audio amplifier 9 by ohanSng an au d? e t" 9 '" 9 ?* 93in ° f an analogue 7 

audio level control: attenuator, or by other means of J 

^t^VZ^t^^JTT, Sp9aker PhvSiCal and audio 

such ^m^X^SSS^ST^SS * 8Udi °-, 8nd Vid8 ° Signd,s - one ' 

is break into three XTE?he fS?S? 22 "ST* 'J?' 0 and ^ si 9 na,s ! 

signals are analwn £l j. . step ' part of tne V| deo and audio < 

stored in memory ^ ^n^S!!2l n - ^ mea8UrftS are temporar V 
the measured Saka?Xi2?Sf ? e8ns ' The "< relationship between 
explored, and Tmaoo na P th * meas ^ed audio level i, 



digtoi video 



r Speaker 
detection 
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Figure 3 
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_read gain 
control model 
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Block diagram for three stages processing 



J£c^^£^^ T 5W6d m ""*■» 
are many other wavs in ^ST^? 0 S ' 9na ' for measuring it, there 

orientation and poking sens j s wfred^o ST" * inC ' Udin9 ™ ion 

three dimensional maooino hTta«f I? 6 3peaker [Media-tab], 

[structured-light] eT™f» V s f *T ran » e u f ' nder [laser1 ' structured light 
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